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Normal Distributions and a Non-Symmetric 
Correlation Inequality for Gaussian Measure 
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Abstract 

Let )i be a Gaussian measure (say, on R’ 5 ) and let K,L C R n be such 

that I< is convex, L is a “layer” (i.e. L = {x : a < (x,u) < b} for some 

a, b £ R and u £ R n ) and the centers of mass (with respect to fj) of 

K and L coincide. Then [i(K fl L) > [i(K ) • fi(L). This is motivated by 

the well-known “positive correlation conjecture” for symmetric sets and 

a related inequality of Sidak concerning confidence regions for means of 

multivariate normal distributions. The proof uses an apparently hitherto 

unknown estimate for the (standard) Gaussian cumulative distribution 

function: $(*) >1-/ 2 (valid for x > —1). 

3x+(x 2 +8) ^ 
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1 Introduction 


n \\ x \\ 2 ‘ 

Let n = n n be the standard Gaussian measure on R” with density e ~ 

(or any centered Gaussian measure on R”). It is a well known open problem 
whether any two symmetric (with respect to the origin) convex sets K\ and K 2 
in R n are positively correlated with respect to /z, i.e. whether the following 
inequality holds 

ii{Ki n k 2 ) > ^(k 1 )h(k 2 ). (l) 

Of course once (1) is proved, it follows by induction that the following formally 
stronger statement is true: 

n(K x n k 2 n... n k n ) > n{Kx)n{K 2 ) ... h(k n ) (2) 

for any convex symmetric sets K \, K 2 , ..., Kn in R" (the same remark applies 
to any class of sets closed under intersections). In the language of statistics, 

(1) and (2) can be viewed as statements about confidence regions for means 
of multivariate normal distributions (cf. Theorem 1A below). In some special 
cases (1) and (2) are known to be true. Pitt [P] proved in 1977 that (1) (hence 

(2) ) holds in R 2 . Also, if K\, K 2 ..., are symmetric layers in R”, i.e., sets 
of the form 


K r = {x € R n : | (x, Ui)\ < 1}, m G R", i = l,2,...,N, 

then (2) holds (note that in that particular case (1) doesn’t imply (2)). This 
was proved by Siclak [S] in 1967 and consequently is referred to as Sidak’s 
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Lemma. See also Gluskin [G] for a proof of Sidak’s Lemma. The proof gives 
in fact a version of (1) with K\ - an arbitrary symmetric convex body and K 2 
- a layer; (2) for layers follows then by induction. We show in Remark 6 of 
Section 3 how Sidak’s Lemma can be proved easily with the approach of this 
paper (an argument of this type seems to have recently occurred more or less 
simultaneously to several people). In 1981 Borell [B] proved that (1) holds for 
a class of convex symmetric bodies in R" with certain additional properties. 
Recently Hu [H] proved a correlation inequality for Gaussian measure involving 
convex functions rather than sets. See also [S-S-Z] for a historical survey and 
other partial results and [K-MS] for related results. 

Here we prove the following. 

Theorem 1. Let K C R n be a convex body and u £ R" \ {0} be such that 

/ ({x,u) - c)dp, n {x) = 0 
J K 

i.e. the centroid of K with respect to jj, n lies on the hyperplane 

H c = {x £ R” : (u , x) = c}. 

Let L = L{a 1 b ) = {x £ R" : a < (x,u) < 6} where a, b £ R are such that the 
centroid of L also lies in H c . Then 

/Xn{L- n L) > ^i n (RT) • [X n fLf 

It is clear that Theorem 1 formally implies an analogous statement with p, n 
replaced by any gaussian measure on R” (centered or not). In the language of 
“confidence regions”, Theorem 1 may be restated as: 
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Theorem 1A. Let X\, X 2 , • • •, Xn, Y be jointly Gaussian random variables and 
bi, b 2 , ..., a, b £ R be such that 

E(F|Ai < h, X 2 < b 2 , ■ . ■, X N < b N , a < Y < b) = E(Y|a <Y <b). 

Then 

P(Ai < bi, X 2 < b 2 , ■ ■ ■, Xn < 6jv, a <Y < b) 

> P(Ai < bi, X 2 < b 2l ...,X N < b N ) -P(a < Y < b). 

We point out that the discrepancy between the degrees of generality of The¬ 
orems 1 and 1A (general convex sets vs. “rectangles”) is only apparent: passing 
from rectangular to general parallelepipeds requires only a change of variables; 
a general convex polytope is a “degenerated” parallelepiped, and any convex set 
can be approximated by polytopes. 

Theorem 1 leads naturally to the following generalization of the “correlation 
conjecture” (1). 

Problem 2. If I\\,K 2 e R” are convex sets (not necessarily symmetric) such 
that their centroids with respect to p n coincide, does (1) hold? 

It is conceivable that the “equality of the centroids” hypothesis is not the 
most proper here and that it should be modified. However, we were led to 
that particular hypothesis while considering some variational arguments related 
to the original (symmetric) correlation conjecture (those arguments yield, in 
particular, an alternative proof of the two-dimensional case shown in [P]). Our 
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Theorem 1 is related to Problem 2 in roughly the same way as Sidak’s Lemma 
is to the original “symmetric” conjecture. 

Theorem 1 is proved in Section 3 (with proofs of some technical lemmas 
relegated to Section 4). In Section 2 we develop some of the tools necessary for 
the proof. They may also be of independent interest, in particular Proposition 
3 which gives an upper estimate on the tail of the Gaussian distribution that is 
sharper than the corresponding “Komatsu inequality” known from the literature 
(cf. [I-MK], p. 17; see also [Ba] for another type of estimate). 

Proposition 3. For x > — 1 

2 ,£ f°° _£ , 4 

-r < e 2 / e 2 dt < - r 

x + (x 2 + 4) 2 J x 3x + (x 2 + 8) 2 

The lower estimate in Proposition 3 is the other “Komatsu inequality” and 
is true for any x € R. The comparison of the upper estimate from Proposition 
3 with classical estimates is given in a table in Remark 4 in the next section. 

Acknowledgement. Research partially supported by authors’ respective grants 
from the National Science Foundation. The final part of the research has been 
performed while the authors were in residence at MSRI Berkeley. They express 
their gratitude to the staff of the institute and to the organizers of the Convex 
Geometry semester for their hospitality and support. 
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2 Preliminaries about Gaussian measure. 


We start with the 

Proof of Proposition 3. We follow the outline given in [I-MK] in the context 

of Komatsu inequality. Put q(x) = ee~ V dt and q+(x) = --- r - It 

is easily checked that g' — xg — 1 and somewhat more tediously verified that 
9+ < xg + — 1. Moreover, (e.g.) a direct calculation shows that g{x) < \ for 
x > 0. By considering the function h = g+ — g and its differential inequality 
h! <xh — 1 one gets (by the same argument as in [I-MK]) that h = g+ — g > 0 
on (0, oo, hence on [0, oo. Since lim x ^_i g+(x) = oo whereas g(— 1) is finite, it 
follows that g+(x) > g(x) also for x G (—1,0) (otherwise consider x € (—1,0) 
for which h attains its minimum). 

The estimate from below is shown in a similar way (and, anyway, it is not 
new). □ 


Remark 4. As was mentioned in the introduction, for x > 0 the upper estimate 
of Proposition 3 is sharper than the well known estimate of Komatsu who proved 
that for x > 0 

2 f°° _*£ , 2 

x + (x 2 + 4) 2 J x x+ (x 2 + 2)5 

(see [I-MK]). We give below the values of relative “errors” (rounded to two 

significant digits) given by the two upper estimates for some values of x ; we 

also list, for reference, the errors of the lower estimate. All of these were calcu- 
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lated using Mathematica and verified with Maple. Our estimate is clearly the 
tightest of the three and vastly superior to the other upper estimate. 


X 

Our 

Upper 

Komatsu’s Upper 

Komatsu’s Lower 

0 

.13 

.13 

- 

.20 

2 

.30 

• 10 -2 

.67 

• 10" 1 

-.17 

• io - 1 

4 

.20 

• 10" 3 

.25 

• 10" 1 

-.25 

• 10 -2 

6 

.27 

• 10 -4 

.13 

• 10" 1 

-.61 

• 10 -3 

8 

.59 

• 10" 5 

.74 

• 10" 2 

-.21 

• 10 -3 

10 

.17 

• IO " 5 

.48 

• 10 -2 

-.92 

• 10 -4 

20 

.30 

• io - 7 

.12 

• 10“ 2 

-.61 

• 10“ 5 

30 

.27 

• IO " 8 

.55 

• 10 -3 

-.12 

• 10 -5 

40 

.48 

• 10 -9 

.31 

• 10 -3 

-.39 

• 10 -6 

50 

.13 

• 10 -9 

.20 

• 10 -3 

-.16 

• 10 -6 


Relative errors of estimates for the “Gaussian tail” for selected values of x. 


The next result is a fairly easy consequence of Proposition 3. 


Proposition 5. Let, for x £ R, 


f(x) = 


fC 

JX 


" 2 dt 


Then 


(i) f{x) is an increasing convex function. 

(ii) x — f(x) is an increasing (to 0 as x —> oo) function. 
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Proof, (i) We compute 


f'(x) = (-- x --s— 

L e 2dt L e 2 dt 


Clearly /' > 0 if and only if 


— x > 0. 


f°° e“V dt 

JX 

If x < 0. this inequality holds trivially; if x > 0 the inequality holds e.g. by 


Proposition 3, as 


We next have 


3x+(x 2 +8)2 x ' 


f"(x) = ^ ,oo e~±dt) 2 -3xe^ e ~^dt + 2). 

(e 2 J e 2 dt) 3 Jx Jx 

Clearly f" (x) > 0 if and only if 


( x 2 — l)(e 2 / e 2 dt,) 2 — 3xe 2 / e 2 dt + 2 > 0. 


We put 2 = e X 2 f°° e * 2 dt and consider the expression above as a polynomial 
in z i.e. z 2 (x 2 — 1) — 3 zx + 2. As the roots of this polynomial are 


Zl/2 = 


3x ± (a; 2 + 8) 2 4 

2(a: 2 — 1) 3a; =F {x 2 + 8)i 


f” > 0 holds trivially for — oo < x < — 1, and holds for x > —1 if 


e 2 dt < 


3x + (x 2 + 8)^ 


which is true by Proposition 3. 

(ii) By the calculation from the part (i) 


(x — f(x)Y = 1 — (- 2 —) 2 + x - 2 —■ 

v ■' v ’> v r oo _ j, r oo _t± ,, 

Jx e 2 dt Jx e 2dt 



After putting z = —-——. the assertion (x — f{x))' > 0 becomes 

r e ~ dt 

J X 

1 + xz — z 2 > 0. 

As the roots of this polynomial are 

x ± {x 2 + 4)2 

z l/2 = - 2 -’ 

the inequality follows, as before, from Proposition 3. □ 


3 Proof of Theorem 1 

The proof of Theorem 1 is achieved in several steps. In the first step we use 
Ehrhard’s inequality [E] to reduce the general case to the 2-dimensional case. 
In the second step, based on (a rather general) Lemma 7, we reduce the 2- 
dimensional problem even further to a four-parameter family of “extremal” sets. 
The final step is based on a careful analysis of dependence of the measures of 
sets involved on these parameters and uses (computational) Lemmas 8 and 9. 

Let K and u be as in Theorem 1 and let Hq be the hyperplane through 0 
orthogonal to u. Without loss of generality we may assume that ||u ||2 < 1. For 
R put H t = Ho + t-u and let <p(t) = n n -i(Kr\H t ) and $(£) = /zi((—oo, a;]). 
By Ehrhard’s inequality [E], %p(t) = $ _1 (i p(t)) is a concave function. Therefore 
it is enough to consider the case n — 2 and, in place of K , sets K.,,, C R 2 of the 


9 



form 


K 4 , = {(x,y) € R 2 :y < ip(x)}, 


( 3 ) 


with u = e i and Hq identical with the y-axis, where ip is a concave, R - valued 
function. We will use the convention $(—oo) = 0, <f>(oo) = 1. It may also be 
sometimes convenient to specify the interval [A, B] = {x : ip{x) > —oo}. The 
assumptions about the centroid become 


/ (x — c)Q(ip(x))dy i(x) = 0= / (x — c)dy i(x) 

J R, J a 


and the assertion becomes 


$(ip(x))dy i(x) > / / dy\{x). 


/R 


( 4 ) 


( 5 ) 


Remark 6. With this reduction of the general case to the 2-dimensional case 
we can now give a quick proof of Sidak’s Lemma. As was indicated earlier, 
Sidak’s Lemma follows by induction from the “symmetric” variant of Theorem 
1, i.e. when L is a 0-symmetric layer (a = — b , b > 0) and K is a 0-synnnetric 
set (hence c = 0). After reduction to the 2-dimensional case, ip is a concave 
function that is symmetric about the y-axis (hence decreasing away from the 
origin) and one has to show that 

/ b poo pb 

®(ip(x))d/j,i(x) > / $(ip(x))d,yi(x) / dyi(x) 

-b J — oo ^ —b 

or equivalently 


f_ b <S>{ip(x))dyi(x) 
f- b dyi{x) 


> 


$(ip(x))dyi(x). 
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The above inequality holds because on the left we are averaging the function 
&(ip(x)) over the set where it is “biggest”, while on the right - over the entire 
real line. 

Actually it is not even necessary to use Ehrhard’s inequality for this proof of 
Sidak’s Lemma. What is really used is (a special case of) the Brunn-Minkowski 
inequality for Gaussian measure (this was pointed out to the authors by A. 
Giannopoulos) and the fact that the Gaussian measure is a product measure. 

Returning to the proof of Theorem 1, we show next that it is enough to 
prove inequality (5) for “extremal” ip's which turn out to be linear functions. 
The reduction to this extremal case holds not only for Gaussian measure on R 2 
but for a much more general class of measures on R 2 and is based on Lemma 
7 that follows. It will be convenient to introduce the following notation: if 
ip : [a, b] —> R, let 

Cip = {{x, y) : a < x < b,y < ip(x)}. 

We then have 

Lemma 7. Let ip : [a, b] > R be a concave function not identically equal to —oo 
and let is be a finite measure on R 2 that is absolutely continuous with respect to 
the Lebesgue measure. Then there exists a linear function ipo(x) = mx + h such 
that 

(i) u(C^) = u{C^ o) 

(ii) /cv y ) = fc+ 0 xdu ( x ’ v) 

ii 



(iii) ip (a) < ipo(a), 4>(b) < ip 0 (b) 

(iv) ip' Q {a) < tp'(a), ip' 0 (b) > tp'(b) 


We postpone the rather elementary proof of Lemma 7 until section 4. 

For a < (3 let us denote 

L(a,(3) = {( x,y) £ R 2 : a < x < /?}. 

In the notation of Lemma 7 the assertion of Theorem 1 (or (5)) then becomes 

v{C$) > v{K$) ■ v(L(a,b)). (6) 

(Note that fl L(a, b)\ the reader is advised to draw a picture at this 

point to follow the remainder of the argument). Let now ipo(x) = mx + h be 
given by Lemma 7. By symmetry, we may assume that m > 0. The plan now 
is to show that, for some (ultimately unbounded) interval [A, B\ D [a, b] and 


V’i(x) 


mx + h if x £ [ A , B] 
—oo if x [ A , B\ 


we have 

y{C^) = i'(CV) 
> y(K^) 


( 7 ) 

( 8 ) 
(9) 
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while, at the same time, the ^-centroids of and K t i, lie on the same line 
x = c, i.e. 


/ (x—c)dv(x,y) = / (x—c)dv{x, y)= / (x—c)dv{x, y) = 0. 

JJ {A<x<B,y<.mx-\-h} JK^ 

( 10 ) 

It will then follow immediately that it is enough to prove (6) with ip replaced 
by ip\ , as required for reduction to the “linear” case. 

Now (8) is a direct consequence of the assertion (i) of Lemma 7 and (7). On 
the other hand, it follows from the assertions (iii) and (iv) that ipo(x) = mx+h > 
ip{x) for x [a, 6]; in other words K^, 0 \L(a,b) D K^,\L(a,b). In combination 
with (8) this would imply (9), if we were able to set [A, B] = [— oo, oo]. However, 
since we also need to ensure the centroid assumption (10), we need to proceed 
more carefully. Let A 0 < a (resp. B 0 > b) be such that 


/■ 

J K^ Q nL(Ao,a) 


c)dv(x,y) 


/ < 

J K^nL( — oo,a) 


c)dv{x,y). 


( 11 ) 


(resp. L{b , oo) and L{b , B 0 ) in place of L(— oo, a) and L(A 0: a)). This is possible 
since c £ (a, b) and, as we indicated earlier, tpo > ip on (—oo, a) (resp. on ( b , oo)). 
Since, by (i) and (ii) of Lemma 7, 


/ (x- c)dv(x,y) = f 


{x - c)dv{x,y ), 


it follows that the centroid condition (10) is satisfied if we set [A, B] = [Ho, Bo], 
Additionally, an elementary argument shows that (11) combined with ipo > ip 
on (—oo, a] implies 


v(K^ 0 n L(A 0 , a)) > v(K^ n L(- oo, a)). 
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This is roughly because the set on the left is “closer” to the axis x = c than 
the one on the right and so, for the “moment equality” (11) to hold, the former 
must have a “bigger mass”. Similarly, f(K^ 0 nL(b,B 0 )) > f) L(b, oo)), 

hence the “mass condition” (9) also holds with [A, B] = [.A 0 ,.Bo]- This reduces 
the problem to linear functions (more precisely functions of type (7)); to get 
the full reduction (i.e. to an unbounded interval [A, B ]) we notice that we may 
simultaneously (and, for that matter, continuously) move A to the left and B to 
the right starting from A 0 ,B 0 respectively so that the centroid condition (10) 
holds, until A “hits” —oo or B “hits” +oo; the mass condition (9) will be then 
a fortiori satisfied. 

Thus, depending on c, m and h, we end up with one of two possible config¬ 
urations 

Ri = Ri(h, B) = {(a;, y) G R 2 : —oo < x < B,y < mx + h} 

Rn = R 2 {h, A) = {(a;, y) € R 2 : A < x < oo, y < mx + ft}, 
for which we have, for i = 1 or * = 2 (whichever applicable), 

Hz(Ri n L(a, b)) = (Cy>) = ^ 2 {K^ n L(a, b )) (12) 

M2 (Ri) > M2(A» (13) 

/ (ar — c)dy 2 = / (x — c)dy ,2 = 0 (14) 

JRi Jk^, 

The three conditions above are just a rephrasing of (8)-(10) for v = /a 2 ; in 
particular it is enough to prove Theorem 1 for the extreme configurations K = 
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Ri,i = 1,2 or, equivalently, to prove (5) for if) = with i[>i given by (7) and 
some unbounded interval \A,B\. This will be the last step of the proof of the 
Theorem. 

Let us note here that even though for the configuration Ri = Ri(h, B) it is 
possible in principle to have the centroid condition (14) satisfied also for B < b, 
we do not have to consider that case as it would have been “reduced” in the 
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Lemma 9. With B = B{h) defined by (If) we have 


dF 1 

~dh 


(h, w) > 0. 


Proof of Lemma 8. We give the proof for B (hence Ri(h, B))\ A and f ?2 are 
treated in a similar way. Showing that ^ > 0 for fixed m and c is equivalent to 
showing that 4% < 0 for fixed B and to. Note that the centroid of Ri(h, B) is a 
“weighted average” of the centroids of the half lines y = mx + h, —oo < h < h, 
—oo < a : < B. Therefore to show that 4# < 0 it is enough to show that 
the x-coordinates of the centroids of the halflines move further away from the 
line x = B as h increases. We make a (orthogonal) change of variable such 
that the line y = mx + h becomes horizontal. Denote the new variables by 
(u,v). Showing that the x-coordinates of the centroids of the halflines move 
further away from the line x = B as h increases is equivalent to showing that 
the w-coordinate of the centroids of the half-lines move further away from the 
corresponding value U = U(h) on the line corresponding to x = B. This means 
that one has to show that 


U — 


f U te *2 dt 

J —OO 

(2tt)5$((7) 


increases as U increases, which holds by Proposition 5 (ii). 


□ 


The computational proof of the Lemma 9 is somewhat involved; we postpone 
it until the next section. 

With Lemmas 8 and 9 we can conclude the proof of the Theorem. Let us start 
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with several observations concerning the qualitative dependence of the regions 
Ri on c and h (for fixed m > 0; m does not qualitatively affect that dependence 
as long as it is positive, the case m = 0 being trivial). These observations are 
only partly used in the proof, but they do clarify the argument nevertheless. 
First, if c < 0 (the special role of 0 follows from the fact that the origin is the 
centroid of the entire plane), then only configurations of type R\ appear. As h 
increases, B = B(h) increases (by Lemma 8) and, as h —> +oo, B approaches 
some limit value B (of which we may think as B( oo)) defined by the equation 

f B 

/ (x - c)dn 2 = 0 

J — OO 

(cf. (14)). It can also be shown that as h—* —oo, B(h) approaches c, but that 
has no bearing on our argument; we do use only the fact that, for fixed w and c 
(hence a, b), the condition B(h) > b (on which we insist, see the remark following 
(16)) is, again by Lemma 8, satisfied for h in some interval (of the type [h*, +oo] 
if c < 0). If c = 0, the picture is similar except that B( oo) = oo. Finally, if c > 0, 
B(h) is also increasing with h, except that it reaches the limit value B = +oo 
for some finite h = h, at which point the configuration R 2 “kicks in”, the half¬ 
plane Ri(h, + 00 ) coinciding with R 2 (h , — 00 ). As h varies from h to + 00 , A(h) 
increases from —00 to some limit value A = A(oo) defined by /1° (x — c)dqi 2 = 0, 
the limit set # 2 ( 00 , A(oo)) being the half-plane {(x,y) € R 2 : x > A(oo)}. 

We first treat Ri(h, B) when c < 0. By Lemma 9, > 0 for all w. Hence 

we are done in this case if we show (15) for the extremal configuration when 
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h = +oo and B = B. But then 


i?i(oo, B) = {(x,y) : x < B} 


and hence 

Fi(h, w) < Fi(oo, w ) = /zi(—oo, B) < 1 

for all h , w. 

Next we consider Ri when c > 0. In this case Lemma 9 reduces the delib¬ 
eration to the extremal configuration with h = h (and B = +oo). Now, as we 
indicated earlier, 

Ri(h, +oo) = {(+ y) £ R 2 : y < mx + h} = R2{h, — oo) 


and so the inequality (15) will follow if we show (16) with A = —oo and the 
same values of c, m. Thus it remains to handle the case of R 2 i.e. we have to 
show that 

F 2 (h,w) < 1 


for all h, w or equivalently that 

y 2 {R2{h 1 A) n L(a,b)) 
1x2 {L(a,b)) 


> /X2(l?2 (h, A)) 


(17) 


for all h, w. Now let us fix h and w (in is fixed throughout the argument) and 
vary A (hence c). The right hand side of (17) is clearly largest if A = — 00 . 
Similarly the left hand side is smallest if A = — 00 ; this follows from the fact 
that, as A is decreasing to — 00 , c also decreases and consequently L moves to 
the left so that ^ 2 (^ 2 {h,A) n L(a,b)) decreases. So also in the case of i? 2 we 
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reduced the argument to the extremal configuration with A = —oo and h = h. 
It remains to show that 


IZo ^ mx + h Wi( x ) 


< 1. 


(18) 


Ja^irnx + h)^^ 

Throughout the remainder of the proof we will occasionally relax the assumption 
that c is the Gaussian centroid of (a, b). We first treat the case h> 0. Observe 

/ d 

<&(mx-\-h)dfj, i ( x ) 

mat m mai case —-—^--decreases as d increases for d > 0 (this is 

f_ d d M x ) 

seen by computing the derivative with respect to d). Therefore 


<&(mx + h)d/j,i(x) < 


f_ d ‘l*(nix + h)d[ii(x) 
f- d dni (x) 


The above is just (18) for a = —b. It now formally follows that (18) holds 
whenever > o (or b > —a): just compare the average of &(mx + h) over 

[a,b] with that over [—|a|, |a|] and use the fact that <f>(ma; + h) is increasing in 
x. In particular, if c is the Gaussian centroid of ( a,b ), then, as is easily seen, 
> c > 0, which settles the case h> 0. 

It remains to handle the case h < 0. 

Let $o = ii 2 ({(x,y) : y < mx+h }) and h 0 = $ _1 (d>o) (i.e. $0 = y 2 ({(x,y) : 
y < 0 • x + ho})). We need to show that 

[ $(lM + h) ~~~jy~~TTT > < l>0 = ®(ho). (19) 

Ja Mi((a,o)) 


Let Xq = h °~ h be the x-coordinate of the point of intersection of the lines 
■y = ho and y = nix + h. If a > Xo, then (19) holds trivially, hence we only need 
to consider the case a < x q. We will show that (19) holds provided > x 0 . 
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Figure 1: The case h < 0. 

In our situation (i.e. when c is the Gaussian centroid of (a, b)) this condition is 
satisfied since > c > xq- Similarly as in the case of h > 0, it is enough to 
consider the case = Xq or b — xo = %o ~ a. To show inequality (19), it is 
then enough to show 

pxo r-b 

/ ($o — $(ma' + h))d^i{x) < / (&(mx + h) — ^o)df-ii(x) (20) 

J a J xo 

or equivalently, by rotational invariance of the Gaussian measure, that 

fxo(®( mx + h ) - ^o)dm{x) 

+ + h ° + (1+m T /2) ( x 0 - «)) - *o)dm(x) (21) 

< fx 0 (®( mx + h ) - d’o )dfii{x), 

where aq = x 0 + ^^ 2 ^ 1 / 2 ) and x 2 = x 0 + (1 + m 2 ) (1/2 \x 0 - a) (see Figure 1). 

Inequality (21) holds, if we can show that 

f X2 . ,x 0 -x , (l + rn 2 )^ 1 / 2 ) . . . 

/ (<F(- \-h 0 + - - {xq — a)) — §o)d/-ii{x) 

Jb m TO 

f b , . . £0 - 2 ; (l + m 2 )^ 1 / 2 ) 

< / (<h(?na; +/i) - <F(-h ft. 0 + -- (xo - a))d/ui(x), 

J Xl to m 

which holds as the triangles over which we integrate have the same Lebesgue 

measure whereas the latter has bigger Gaussian measure as the (restriction of 
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the) reflection which maps the first one into the second is “measure decreasing” 
with respect to the Gaussian measure. 

As shown before, this also completes the proof of fq < 1 and consequently 
that of the Theorem. □ 


Remark 10. We wish to reiterate that, at least in the case when K is a half 
plane {{x,y) : y < mx + h}, the requirement that c is the Gaussian centroid 
of (a, b) may be relaxed somewhat: to > 0 if h > 0 and to > xq if 
h < 0. It follows that the same is true for regions of type R 2 . There is also 
some flexibility in the handling of regions of type i?i, and consequently of an 
arbitrary K. However, since we do not have any natural description of the 
allowed “relaxation”, we do not pursue this direction. 

4 Proofs of the Lemmas. 

Proof of Lemma 7. We shall tacitly assume that the density of v with respect 
to the Lebesgue measure is strictly positive, which is the case we need in our 
application; the general case can be easily derived from this one. We shall also 
assume that ip doesn’t take the value +00 , in particular ip is continuous (the 
opposite case is easy to handle directly) and that ip is not linear (if it is, we are 
already done). For m € R let the line ip^ m \x) = mx + h be such that 

v({(x,y) : a < x < b, y < ip(x)}) = v{{{x,y) : a < x <b,y < ip {m) (x)}), (22) 
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where h = h(m)', it follows from our assumptions that h(-) must be a continuous 
function. The graph of ip( m ^ cannot be completely above the graph of ip on (a, b) 
nor completely below the graph of ip on (a, b); otherwise the “mass equality” 
(22) would not hold. Therefore all the lines satisfying (22) intersect the graph 
of ip in at least one point (p, ip(p)) with a < p < b. 

Now suppose there is a line ipo{x) = inoX + h for which (22) holds, for which 
the “moment equality” 


xdv = 


xdv 


' {(x,y):a<.x<.b,y<.'ip(x)} 


' {(x,y):a<x<b,y<'ipo(x)} 


(23) 


holds and which has exactly one point of intersection (p,ipo(p)) with the graph 
of ip. Then ip < ipo on one of the intervals [a,p], [p, 6] and ip > ipo on the other. 
On the other hand, it follows from (22) and (23) that 


/ {x — p)dv = / (x — p)du, 

J {( x,y):a<x<b,y<'ijj(x )} J {(x,y):a<x<b,y< / ipo(x)} 

which is inconsistent with the preceding remark if ip and ipo are not identical. 
Consequently, the line y = ipo{x) with the required properties (22) and (23) has 
to intersect the graph of ip in at least two points (p\,ip(pi)), (j> 2 ,ip(p 2 )) with 
a < pi < P 2 < b and, by concavity of ip, in exactly two such points. Again by 
concavity of ip this is only possible if the assertions (iii) and (iv) of Lemma 7 
hold. 

It thus remains to show that among the linear functions ip^ for which the 
“mass equality” (22) (hence (i)) holds there is one for which also the “moment 
equality” (23) (hence (ii)) holds. To this end, observe that as m —> +oo, the 
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lines y = ip( m \x) “converge” to a vertical line x = ai, where a\ is defined by 
v(L(ai,b) = v(Cip). One clearly has 


xdv < 


xdv. 


JC^ J L(ai ,b) 

Similarly, as in —> —oo, the sets C^(m> “converge” to a strip L(a, bi) satis¬ 
fying f L ( a xdv < f c xdv. By continuity, there must be mo E R such that 
if o = ip( m °l verifies (ii). This finishes the proof of the Lemma. □ 

For the proof of Lemma 9 we shall need an elementary auxiliary result. 


Lemma 11. Let g be a convex function on an interval [a,(3\ and let p be a 
positive measure on [a,/?]. Let a', f3' be such that a < a' < (3' < j3 and suppose 


that 


Then 


(fa xdp(x) jf, xdp{x) 


P([<*,0\) 


n 7^Tv)(5(/3')-5(« , ))>0. 
P([a ,Pl) 


If g(x)dp{x) jf, g(x)dp(x 


(24) 


> 


P([a,0\) P([a',f3'}) 

Note that if, in particular, 0r if 9 ^ = 5( a '))> then 

the assertion holds. We skip the proof (the reader is advised to draw a picture). 


Proof of Lemma 9. We recall that by the comments following the statement 
of Lemma 9 (see also the remark preceding (15)), for fixed w and c (hence fixed 
a, 6), we do need to consider h* < h < h, where h = h* corresponds to B = b 
while h = +oo if c < 0 and h (< +oo) is defined by B{h) = +oo (or A(h) = — oo) 
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if c > 0. We have 


dF 1 

~dh 




u \ dn i(x) 
' Mi (a,6) 


+ ft) + e ~J.( mi+,, ) 2 (//j 1 (j) 


( 27r )^ (/! $ ( m:c + / 1 )^; 


;<)) 


f B Qo ®( mx + (x) e ^ mx+h)2 ^t$ 

(/a $ ( TO2: + h ) *HlS)) ) 

As S' > 0 by Lemma 8, > 0 will follow if 

/-oo > J" ^yHvW i(aQ 

ffoo&ivW/Hix) - f*$(y)diH(x) 

where y = mai + h. 

By Proposition 5, t/(y) = is a convex decreasing function (note that 

g(y ) = f(—y), where / is as in Proposition 5. Moreover 

f B oo x$(y) d yi(x) = /q xdyi{x) < x<f>(y)dy ipr) 

Ja d di(x) f* $(y)dtH(x) ’ 

as $(y) is increasing, and so the condition (24) is satisfied with dp(x) = $(y)dyi(x). 
Consequently Lemma 11 yields (25), completing the proof of Lemma 9. □ 
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